
'1 Experimental Designs 


® ANOVA is very common with traditional 
designs of experiments involving 1 or 
more “factors,” with 2 or more “levels” 

• Factor 
o Level 

® Factors can be “between” or “within” 

• A.k.a. Independent/Dependant Measures 

• A.k.a. Grouping/Repeated Factors 



nJToday’s Topic (SMAR-t ): ANON 


® Analysis of Variance (i.e. ANOVA) 

• Independent Measures ANOVA 

• Repeated Measures ANOVA 

• Mixed Factorials 

• Analysis of Covariance (ANCOVA) 
o Using Covariates 



HJTypes °f Outcomes for ANON/ A 


® Continuously scaled outcomes assumed 
to follow the normal distribution, or that 
can be transformed so that it does (i.e. 
“normalized”) 

• Examples: BMI, BP, BMD, Strength, 
Standardized Scores, Viral Loads, Force, 
Averages or Sums of Likert-Scaled items 
(scale scores), Optical Density, Volume, 
Response Time, Distance, etc. 








m Central Limit Theorem 


® States that for any population with mean p 
and standard deviation a , the distribution of 
sample means with sample size n will 
approach a normal distribution with p and SD 
of as n approaches infinity. 

® REGARDLESS of the shape of the 
distribution in the population. 

® By the time sample sizes hit around 30, 

sampling distribution of means is close to 

normal. . — ■ A 


Thus. 


® Since we know so much about the Normal 
Distribution 

® And we know that sample summaries (means 
or otherwise) tend to follow that distribution 

• Even data collected from non-normal samples 

• Especially so with large sample size (big-n) 

® We can usually apply our knowledge of the 
normal distribution to statistical comparisons, 
estimates, and probability 

• As long as we do some preliminary screeni 



r J Moving to the t-test for comparing two 
samples 


Hi samples 

® Used for comparing 
two samples 
collected randomly 
from two populations 
® Many other flavors of 
the t-test exist. . . but 
we’ll start here. 


Population I 
I Sample 1 I 



Population 2 

o 

Sample 2 








m T-tests on the Computer: 

® Software gives us t-score and a p-value 
® Allowing us to test hypotheses that the two 
samples come from the same population (or 
not) 

® And describe the magnitude of the differences 
(confidence intervals) 

® Ex. t = 4 87, p<. 001 

• H nu „: Two samples are from same population 

• H alt : Two samples are from different populations 
® Reject the Null (alpha < .05) & Report the 

magnitude of the differences 




m Virtues of thet-test 


® EVERYONE seems to understand it! 

® With CLT, it’s easy to apply to lots of 
different data scenarios 
® There are other versions that make it 
very flexible 

• Formula for “Repeated Measures” designs 

• Formula for problems associated with non- 
normality and/or variance heterogeneity 
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"! Hypothesis testing Scenari 


L, * 


® The “null” hypothesis for the t-test is that 
the two groups come from the same 
population 

• Thus will have similar means, given sd 
® The “alternative" hypothesis is usually that 

they don't 

• Thus have “different" means, but similar sd 

• Can be directional 

® We use the t-statistic in an attempt to 
Reject the null, supporting our claim of the 
alternative ^aggi 
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Consequences of Hypothesis 
Testing & Alpha 


■lllil 


The Truth is: 


Your decision is: 

You Rejected H 0 Due to a 
Statistically Significant Result 
(Conclude the 2 groups must 
come from different 
populations) 

You Accepted H 0 Due to a 
Non-Significant Result 
(Assume the 2 groups are 
come from same population) 


H 0 Really isTrue 

(there’s no effect) 

Type I Error 

Probability = a 


Probability = 1- a 


H 0 is Actually False 

(there is an effect) 

Power 

Probability = (1-3) 


Type II Error 

Probability = 3 



If you have a “non-significant ' 1 
result: 


The Truth is: 


H 0 Really isTrue 

(there’s no effect) 


H 0 is Actually False 

(there is an effect) 


Given a non-significant t-score comparing means.. 



"iComparing Three Groups 


Fff Limitations of t-tests 


® Alpha risk is .05 for each t-test 

• Probability of falsely rejecting the null, and 
concluding that there is a difference, when 
it's really due to chance. 

• So comparing 3, 4, 5 or more groups is quite 
problematic! 
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‘Comparing Three Groups 


Comparing Three Groups 


T-test number 1 
Alpha risk = .05 


T-test number 1 
Alpha risk = .05 


T-test number 2 
Alpha risk = .05 


T-test number 3 


, (Comparing Three Groups 


Comparing Three Groups 


T-test number 1 
Alpha risk = .05 


T-test number 2 
Alpha risk = .05 


JMtest number 2 
Alpha risk = .05 


T-test number 3 
Alpha risk = .05.. 





'1 Analysis of Variance (ANOVA) 


® Can compare unlimited number of groups 
or occurrences, and still keep alpha risk = 
.05 

® Able to take multiple grouping (or time) 
factors into account and determine their 
independent and combined effects 
® Can examine “trends" in data, and can test 
specific (often complex) hypotheses 
® The analytic focus is on variance, but the 
interpretation falls back to means — thus 
results become intuitive 



: 



N "~!Assumption of Interval or Ratio Scale & Normal 

® The “bell-shaped” curve — assumption of 
all parametric statistics 
® Studies show that ANOVA is robust to 
violations of this, but only if sample size 
is substantially large, and Homogeneity 
is met , . Hi 


•£/ im 


ssumption of Homogeneity of Variant 
Across Groups 


® Variance on the dependant variable should be similai 
across groups 
• Why? 

® Because we’re examining VARIANCE in ANOVA, and 
so we need for variance in each group to be roughly 
similar before we can conclude that any differences that 
we find are attributable to group differences (not mere 
variability differences). 

® Even in Means Comparisons (ex.t-tests), since Means 
are highly affected by variability, we need for variability 
to be similar in our groups so that differences that we 
find can be attributed to true group differences, and n 
merely by variability differences between our groups. 
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More on Homogeneity of Variance 


® If distributions in 1 
group is leptokurtotic 
(tall and skinny), 
then it should be for 
all other groups 



Group 


Group 


Group 







® Any Miss-Match is a 
Problem 

• Because we might interpret a 
statistical differences to real 
group differences, when it’s 
actually due to heterogeneity 
of variance 

® . . .Thankfully there are ways 
to test for this problem, and 
solutions are sometimes 
possible. SPSS will test this 
assumption for us (stay 
tuned) 


I'' ' 


m What about skewed data? 


® Positive or negative skews in the data 
can wreak havoc with statistical analysis 

• Thus always recommend thorough data 
screening 

• Identify outliers — data entry errors? 

• Consider data transformations if necessary 

A great thing to google! 



Hi Common Transformations 


Logarithm 

- 

Reflect and 
/ Logarithm 



i 

Inverse 

A 

Reflect and 



Inverse 

1 

Tabachmck.BIi^&ftdarCs! (1989). Using 

J 

“ fXuthvanete ■. Statistics ■. 2"? Ed: :New York: Harper- 
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ffflTwo General Types of ANOVA 


® Independent Measures ANOVA (IM-ANOVA) 

• Data are collected from separate groups of subjects, and 
comparisons among groups are desired 

Muscle Size by Treatment (controls vs two intervention groups) 
o Blood Flow by Gender 

® Repeated Measures ANOVA (RM-ANOVA) 

• Data are collected from the same group of subjects on 
multiple occasions/times, and comparisons of occasions are 
desired. 

o Longitudinal Studies 

o Outcomes measured at L-10, L-1, L+2, L+5 L+25, R+2... 
Balance Scores Pre Bedrest. During and Post Bedrest 


® Mixed Factorial 

• Mix of IM and RM factors in the same experiments, 
Gender (m,f) by Time (pre. post) effects " 


^One-Way IM-ANOVA 



NASA; 


® For comparing two or more population; 

• Where sample data have been collected 


ft BANOVA: What’s in a Name 


Total Variability 


Vwffhin Groups 
/ Variability 
/•Individual Differences 
(IDs) 
yError 


./Between Groups 
— > Variability: 

•Individual Differences (IDs) 
•Error 

(Real Group Differences 




"! Analysis of Variance F-Rati 

® ANOVA is truly an analysis of a 
measure of variability, called “variance, 
that measures and separates variability 
attributable to 

• Within-Groups Variability 

• Between-Groups Variability 
® We Evaluate an “F-Ratio” Representing 

the Ratio of B/T over W/l: 

P _ variability between groups _ ID‘s + error + group differences A 
variability within groups I D's + error kill 


m Recall your Simple Algebra... ! 

\ 

® If the same quantity exists in the Numerator 
and Denominator of a fraction, they ‘‘cancel 
each other out" \ 


Assuming 
homogeneity 
of variance 





_ variability between yt p 

variability within groups IY^ < +'t?h^>r 




Recall your Simple Algebra... 


® If the same quantity exists in the Numerato 
and Denominator of a fraction, they “cancel 
each other out” 

• Leaving us with a number (F) that 
represents Group Differences! 


I inability between groups IP's ♦ crnC+ jtmup JitTcraivo^ / 
variability within groups ID's -terror f I 



RH Analysis of Variance F-Ratio 


® If F=1 ... tr f' 

® As F increases... p 

® Flow do you know if F is “big enough” to 
considered significant? 

• How do you know a t-test is significant?? 


P _ variability between groups _ IP's + error + group differences 
variability within groups ID's -t error 





Confidence Intervals with the F-test 

® Cl’s for comparing two groups are 
straightforward and intuitive 
® Cl’s for “Omnibus” differences are less so 
• Effect size calculations exist, but less intuitive 
interpretation.. 

® Stay tuned for discussions about post-hoc 
tests, and how they can sometimes help 
® Plots will also be very informative 


IM-ANOVA Summary Table 


® Purpose is to provide the necessary 
components of the F-test 

• Variability (SS) 

• Degrees of Freedom (df) 

• Mean Square (MS) 

• F-statistic (F) 

• Probability values associated with F 

® Total, Between Groups, Within Groups 




' IM-ANOVA Summary Table 


® Purpose is to provide th e necessary 

components of the F-t €T~~ Bum of Squared Deviations from 

• Variability (SS) theMean . 

• Degrees of Freedom (df) 

• Mean Square (MS) 

• F-statistic (F) 

• Probability values associated with F 
® Total, Between Groups, Within Groups 


IM-ANOVA Summary Tables 


® Purpose is to provide the necessary 
components of the F-test 

• Variability (SS) [ Like in a Mest each F ’ test has * 

J v va lues for ^sianificance testing _ 

• Degrees of Freedom (df) 

• Mean Square (MS) 

• F-statistic (F) 

• Probability values associated with F 

® Total, Between Groups, Within Groups 









mam 


NASA: 


This is where it comes from (Independe 
Measures Designs) 


(numerator) and df w 
(denominator). 
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One-way ANOVA example for SMAR 


Levene's test of Homogeneity 
The null for Levene's is that the 
variances are similar, so we do 
_ NOT want to reject this one! __ 













Recall our earlier analytic choices, 





know what’s going on... but 
most graphics programs are 
“drama queens” when they 
select axis ranges! Be 


iff Post-hoc Pairwise Comparisons 


i 



If If we had a-priori contrasts'; 


® We chose all possible pairs to compare post 
hoc, adjusting for the number of comparison 

• UC vs, T reatment A 

• UC vs. Treatment A + Caffeine 

• T reatment A vs. T reatment A+Caffeine 

® If we had a more specific set of comparisons 
that we wanted to make A-priori, we could 
have more statistical power, at the expense 
unnecessary comparisons. / 


m In this example? 


® May make sense to compare Usual care 
to either of the novel Treatments, but not 
to compare the two novel treatments? 

® “Simple” contrasts, with a reference 
category (usual care) 

• Usual Care vs. Treatment A 












"iCake anyone? 

■ 

® You can’t have your cake, and eat it too! 
® Good Science dictates that you either 
HAVE a-priori contrasts, or you DON’T! 

• Contrasts are theory-driven, not something 
that you do “after the fact” 

• Post-hoc tests are more appropriate if you 
want all possible pairs of comparisons 



Next Time: Two-Factor 
ANOVAS 

® What if you want to compare 2+ groups 
on MORE THAN one factor? 

• Effect of subjects’ gender and Treatment on 
BMD? 

• Effect of Novel Treatment (vs. control) and 
Implementation Schedule (two types) on 
Countermeasure’s Effectiveness? 











Recap 

® Analysis of Variance (ANOVA) examines 
variability between groups, relative to withi 
groups, to determine whether there’s 
evidence that the groups are not from the 
same population 

® Analysis focuses on variance, but 
interpretation is about mean’s 

® One-way ANOVA compares more than two 
groups. 

• Similar to a t-test, but for 3, 4, 5+ groups f 



i 
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Recap 

® ANOVA assumes 

• Random samples from the population 

• Sufficiently large enough n to detect effects, 
distributed evenly among groups 

• Similar variability among groups (Homogeneity of 
Variance) 

® We should examine our data and test our 
assumptions 

• Sometimes we need to consider data 
transformations to meet these assumptions 

• Sometimes we need to rely on robust aitematiy&o 
to the typical ANOVA statistic 


Recap 


® ANOVA results summarized in a ANOVA 
table, with an “Omnibus F-statistic” and p- 
value 

• Represents the ratio of between/within variability 

• If significant, reject the null hypothesis that the 
groups are from the same population 

® Researchers typically follow-up a significant 
F-ratio with either j 

• Post Hoc tests f 

• A-Priori Contrasts / 






jlJToday... Multifactorial ANOVA 

® What if you want to compare 2+ groups on 
MORE THAN one factor? 

• Effect of subjects’ gender and Treatment on BMD? 

• Effect of Novel Treatment (vs. control) and 
Implementation Schedule (two types) on 
Countermeasure’s Effectiveness? 

• Effect of Suit Pressure (3 settings) and Glove 
Design (2 types) on EVA performance? 

® Still working with completely Independent 
Measures Designs 

• Subjects in one “cell" are not also in any other 


Table Representation of Experiments 
Two-Factor Designs 


3x2 design 
Study n=120 

No Drug 

Low Dose 

High Dose 

Drug A 

n=20 

n=20 

n=20 

Drug B 

n=20 

n=20 

n=20 


3x3 design 
Study n=180 

No Drug 

Low Dose 

High Dose 

No Therapy 

n=20 

n=20 

n=20 

Therapy A 

n=20 

n=20 

n=20 

Therapy B 

n=20 

n=20 

n=20 


2x2 design Control Inte 
Study n=78 

Males n=20 n= 




More Complicated Design 

® ANOVA can handle 3, 4, 5, or even more 
factors! 

• “k” is the common notation for number of factors 
an ANOVA design 

® But be careful what you ask for. . . stay tuned 


2x2x2 design 


^lacebo 

Experimental Drug 

Study n=152 






Chronic 

Acute 

Chronic 

Acute 


Use 

Administration 

Use 

Administration 

Males 

n=20 

n=18 

l 

CO 

n=1.7- - " / 

Females 

n=19 

n=21 

n= 1 8 

n=20 l t 




Main Effects and InteractL. 


® Main Effects 

• One per factor. . . an F-statistic evaluating the impac 
each factor in the model 

Gender effect on performance (M/F diffs?) 

Race/ethnicity effect on performance 

® Interaction Effects 

• One per interaction... an F-statistic evaluating how tw 
(or more) factors interact with one another to affect th 
outcome 

Gender “by" Race/Ethnicity interactive effects on performa 

More complex. . . often more interesting! 



m Interactions. 


® Interaction effects are often the most 
interesting, but can be tricky to understand at 
first 

® We describe them in terms of how many 
factors are involved 

• “Two-way” means two factors work together to 
explain the observed difference 

• “Three-way” means that three factors tell the story 

• “Four-way” means that you'd better have some J 

pain relievers nearby! / 


Two-Factor Example 


® Compare Performance Gains Following 
Exercise Intervention by Subjects’ Initial 
Fitness Status 

• Subjects' Current Fitness Level Upon Enrollment 
in Study 

o Couch Potatoes, Fit Individuals 

• Intensity of Exercise Program 

o Low, Medium, High /j 

® 2x3 ANOVA / 









n*Sa; 


jp First Example 

® Use “Improvement” as our outcome variable 





























The Analysis ( the easy way i 


Table of mean: 


Anyone remember what the Leve 
tells us?? 
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The ANOVA Summary Table: 
-interaction Effect? YES!!! 
->Main Effect for Fitness Level? 
->Main Effect for Exercise Intensity? 
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The ANOVA Summary Tabl 


->lnteraction Effect? YES!!! 
->Main Effect for Fitness Level? Ignore! 
->Main Effect for Exercise Intensity? 
Ignore! 


And when there's an Interaction 
effect, all Main Effects involved 
should be ignored. They are 
qualified by ' an Interaction Effect 
that tells more of the story. 





Interaction Plots 

® Lines that aren’t parallel suggest an 
Interaction Effect 

• One line is different from the other 

® Sometimes the lines cross over one another 

forming an “X,” “zig-zag” or other 

• Dramatically different effects 


« 

‘ 
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Hi We have an interaction, Now 


® A-priori hypotheses rule the day! 

• If you had 'em, test 'em! 

® If not (more typically), then many will argue 

that you are finished with analyzing the data 

• Plot the means (albeit better than SPSS does by 
default), report the Interaction effect, and explain. 

® Others argue that you are justified for runnin 
additional analyses to look for potential 
differences in one factor, nested within le' 
of the other 
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| Our Study? 

® Let’s assume we thought there would be a 
difference in effects of low, medium, high \ 
intensity by fitness levels 
® Let’s further assume that we did not have a- 
priori hypotheses begging for specific 


contrasts, but rather wanted to follow-up with 
whatever post-hoc tests we are justified at 
running 












HWrite-up? 


...Our analysis revealed a significant Fitness level by Exercise 
Intensity Interaction Effect (p< 0.01). Mean data reveal how the 
differential effect of Low, Medium, or High Intensity Exercise 
programs affected Couch Potatoes differently than Fit Individuals 
(see fig). 

(You could stop here and discuss some theory explaining why this 
may be the case You could also continue with the simple effects 
if you think that it further illuminates important findings.) 




...Our analysis revealed a significant Fitness level by Exercise Intensity Interaction 
Effect (p< 0.01 ). Mean data reveal how the differential effect of Low, Medium, or 
High Intensity Exercise programs affected Couch Potatoes differently than Fit 
Individuals (see fig). 

Given the significant interaction, we also ran simple effects contrasts (Bonferroni 
adjusted) comparing the pairwise effects of Low, Medium and High Intensity 
exercise interventions within Fitness level. Couch Potatoes randomized to the 
High-Intensity intervention showed significantly greater gains relative to Couch 
Potatoes in the Low-Intensity condition (p<.01). Neither of these groups differed 
from the Medium-Intensity Couch Potatoes group, though the Low-Intensity Couch 
Potatoes were marginally significantly different from their Medium-Intensity 
counterparts (p=.055). 



IWrite-up? 


\ 


... Given the significant interaction, we also ran simple effects contrasts 
comparing the pairwise effects of Low, Medium and High Intensity exercise 
interventions within Fitness level. Couch Potatoes randomized to the High- 
Intensity intervention showed significantly greater gains relative to Couch Potatoes 
in the Low-Intensity condition (p<.01). Neither of these groups differed from the 
Medium-Intensity Couch Potatoes group, though the Low-Intensity Couch Potatoes 
were marginally significantly different from their Medium-Intensity counterparts 
(p=.055). 

In contrast, the pairwise comparisons with the Fit Individuals groups revealed 
significant and clear benefits of those randomized to High-Intensity exercise 
relative to Low (p<.01), and those in the High versus Medium Intensity groups 
(pc.01). The marginal effect observed among Couch Potato Low versus High 
subjects was not evident in Fit Individuals randomized to either of these exercise 
interventions. ki 
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Next? 


Results.. .scrolling down 


® Let’s try it again using “lmprovement2” 
instead of “Improvement” as our observed 
results. 


The ANOVA Summary Tab 
-> Interaction Effect 


->Main Effect for Fitness Level 
->Main Effect for Exercise Intensit 


® This is for illustration purposes — pretend like 
these were our data instead of the earlier 
results... 

• The analysis set-up is the same. 


Results? 


Results.. .scrolling down 


Table of means & sd. 


What is the Levene test telling us this time? 




Interpreting Multi-Factorial ANOVA 
Results \ 

® If you have an Interaction Effect, Start There! 

• All main effects involved in a significant interaction 
are qualified effects anyway — they don't tell the 
whole story 

• This was the case in our earlier example 

® If you do not have Interaction Effects, 

Interpret whatever Main Effects you have 

• This is the case in our current example 

Post-hoc or Contrast effects may be useful now? / 


EE 


rif Back to our Example. 


® Main effect for Fitness Level 

• Couch Potatoes improved more 
than Fit Individuals 

® Main effect for Intensity of 

the Exercise 

• Looks like increasing intensity 
produced greater benefits 
overall 

• FOIIOW-UP? npnpnripnt Variahlf 





Remember when we skipped pos 
tests? 

® Now that we know we \ 
don’t have an Interaction x 
effect, it’s time to 
consider post-hoc tests, 
if you desire 

• Compare All Intensity 

Levels Pairwise, collapsing 
across Fitness Level 

® Why not contrast comparisons of the levels of one 
factor “within” the other? 





Remember when we skipped pos 
tests? 

® Now that we know we \ 
don’t have an Interaction x 
effect, it’s time to 
consider post-hoc tests, 
if you desire 

• Compare All Intensity ' — 

Levels Pairwise, collapsing 

across Fitness Level 

® Why not contrast comparisons of the levels of one 
factor “within” the other? 

• No Evidence that the differences between Low, Med, 
are any different in Couch Potatoes versus Fit People 
















jfJ|Today: Repeated 


Measures ANOV, 


® With RM ANOVA, we consider measuring 
SAME subjects at different times, or under 
different conditions, to see if something 
changes over time, or between the different 
conditions 

• Ex. Does performance decrease in response to 
time spent in microgravity? 

• Ex. Does bone mass decrease as subjects age? 

• Ex. Compare Subjects' Ratings of ‘XYZ' when 
taking Placebo, versus Drug A, versus Drug B, 
with adequate washout periods, between drugs . 

• Ex. Compare PRE to Post"! and Post2... / I 


k I 
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'^Repeated Measures Designs 


® Only one sample 
® Differences based on 
time, or condition 
® Using the SAME 
subjects time after 
time 

® Measuring the SAME 
outcome each time 
® Looking for 
changes... 


'During- 



A 


NASA; 


■Repeated Measures ANO\A 


® Same people... no 
“individual differences” in 
the F-ratio 

® More powerful statistics 


® The F-Ratio represents: 


Variability among times (or conditions) error + time (or condition) differences; 


Variability within the sample 


■Repeated Measures ANOV 


® Same people... no 
“individual 

differences” in the F- 
ratio 

® More powerful 
statistics 


Variability among times (or conditions) 


time (or condition 


Variability within the sample 


^Assumptions for RM ANOVA 


® Same as for IM-ANOVA RE Ordinal or 
Continuously Scaled Outcomes following th 
normal distribution 

® Random sampling from the population with 
sufficient n 

• Except now only one sample... 

® Homogeneity of Variance Does Apply in 
purely RM models, (only 1 group!) 

® Instead, Assumption of Sphericity 

• Assume that the covariance among pairs of / 

repeated observations are equal. ,• 


k ' A 


m RM-ANOVA Summary Table 

® Same Concept as IM Table, but now 

• Instead of “Between Groups” effects, we have 
"Between Treatments" effects 

• And also “Within Treatments” 

Consist of subject differences (among subjects) 

And error 

® One Group measured several times, thus we 
partition “within group” variability into that which 
is due to individual differences, and error. 




This is where it comes from 
(Repeated Measures Designs) 


k=number of 
times 
measured 


Same as IM Anova 
= Same as IM Anova 
= Same as IM Anova 


’ between 


within 


each person's total across treatments 


' b 1 1 subjects 


within 


b / 1 subjects 


between 


>~hff subjects 


s«jThis is where it comes from 
1 ^(Repeated Measures Designs) 


MS, 


SS , 


between 


between 


4fb 


between 


CC 

— error 
1V1 ^> error ~ „ 

./ error 


F 


MS, 


between 


MS,, 


F-tables provide a p value 
given F-statistic, using df, 
(numerator) and c 
(denominator) 
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Example 


® Compare Performance Pre, During, and Post 
Bed-Rest 

• Pre (time zero) \ 

• Three Weeks Into Bedrest \ 

• Six Weeks Into Bedrest (end of bedrest) 

• Three Weeks FOLLOWING Bedrest (week 9) 

® Same Subjects measured 4 times 
® Equal Interval between time periods 



In This Example... 


® Subjects (n=34) were measured on some validate 
performance scale four times, with equal intervals 
between time periods 

• Pre, Week 3, Week 6, Week 9. 

• Bedrest STOPPED at the end of Week 6 

• (Dataset created for instructional purposes) 

® Prior Research has shown that these data tend to 
follow the normal distribution 




"Examples of Long Dataset 


Long format, with as many 
rows per subject as needed 
to capture all of the repeated 
observations 
(same data as prior slide) 


SPSS requires Wide format 
for Repeated Measures 
Designs. 


Subjects with ANY missing 
observation will be completely 
ignored in a purely repeated 
measures (fixed) ANOVA 


This gets tricky when there 
are two repeated measures 
factors... stay tuned! 


This can be a big problem with 
small n! We have n=34 here, but 
4 are missing at least one 
observation... so Study n=30 
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RM_<M7VA M.(DaUS*1) SPSS SUfnUr 


: KM M»JI« t«r(pa1aSrti| SPSS SUIhlfat Data El 


Options for RM Designs. 


This is where you tell SPSS 
which columns represent the 
four repeated observations. 
Be sure to bring them over 
in the CORRECT ORDER! 
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pffl|The easy way.. .with Syntax 


OATASET ACTIVATE CutaSdl 


Interpreting the Output 











NASA! 


iff Interpreting the Output 


Interpreting the Output 


Interpreting the Output 


® Performance started high, dropped during bedrest, 
and seems to have returned by recovery. 

® Different Options on how to follow-up with our 
significant ANOVA telling us that “things changed" 


Mauchly's test tells us whether we met the 
assumption of sphericity. If significant, we violated 
this assumption and need to adjust our F-statistic 
accordingly (stay tuned!) 













"What next? 


k 
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® Like with the Oneway IM-ANOVA, we 
probably had a more in-depth research 
question in mind, other than "did things 
change?” Did we want to: 

• “Characterize the nature of the change?” 

• “Compare “Pre-" levels to all “Post- levels” and 
report on our findings?” 

• “Compare everything to everything else and hop 
that something, anything, is significant so that 
can get a paper outta this study???” 
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Back to our Output. 
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Another Example 



Want to know if BMD 
changes during 
bedrest are affected 
by an intervention, so 
we have 2 groups 
(Control, Intervention) 
and we measured 
their BMD Pre- 
bedrest, then again at 
30, 60 and 90 days. 

Want to know if the 
change over time is 
less (better) in the 
Intervention group 
relative to controls. 
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Next Time 

® Meet again at noon, Thursday, Sept. 24 th 
® Begin reviewing Hypothesis Testing using 
ANOVA, Regression, or Other topic per today 
® PPT Slides & “Screenshots” from Statistical 
Software 

• Promise... no hand calculations & minimal 
formulae! 

• Promise... fun & applied, with enough “meat” to get 
you started and keep you statistically-safe 

Or at least enough to know when it’s time to call usl! 








